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© A voice-following video system (100) for dynam- 
ically capturing the view of an active speaker in- 
cludes a video camera (102). a position control sys- 
tem (106) for the video camera, and one or more 
voice-activated emitters (104). A voice-activated em- 
itter (104) IS associated with each potential speaker. 
As a person talks, the associated voice-activated 
emitter transmits a position signal. The position con- 
trol system (106) uses the position signal as a bea- 
con to determine the location of the active speaker. 
The position control system then adjusts the pan 
and/or tilt of the video camera (102) to bring the 
active speaker into the field of view and tracks that 
person's movements. In this manner, as one person 
stops speaking and anotfier starts, the video camera 
IS automatically repositioned, capturing [he view of 
the active Sfjoakor . 
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Background of the Invention 
Field of the Invention: 

The present invcfMiun fulalo^; t*; v\iU.'v systcfTis 
More particularly, the prosOfTt invention rtHato^; to 
voire-followtng video systonis whn fi -Ivfianii.: ally 
follow tho active spo^aker arTiong a gioep of in- 
dividuals 

Related Art: 

Video systems are typically employed in a 
multitude of situations, CommofT applications in- 
clude the recording or broadcast of business pre- 
sentations, lectures, seminars, and the recording of 
social events Additionally, video conforcficmg be- 
tween remote locations has becorTie an increas- 
ingly popular alternative to having off-sito fTieetmgs. 

One historic problem associated with video 
systems has been providing low cctst, at.JtorTiated 
means to dynamically and gracefully switcfi ttie 
video carTiera's field of view, in a limdod number of 
ap(}lications. such as ttie recording of a lecturer m 
a classroom, a static field of view is adequate. 
Most applications, however, require the ability to 
change the camera's field of view between in- 
dividuals, charts, or visual aids. Group video con- 
ferencing, which requires the simulation of face-to- 
face communications with the local and remote 
participants is an illustrative example of an environ- 
ment where versatile camera positioning is impera- 
tive. 

Generally, both manual and automated tech- 
niques have been utilized to address this problem. 
Manual switching between fields of view has prov- 
en to be neither pragmatic nor satisfactory. In 
some applications, a technician has been em- 
ployed to control the pan. tilt, and zoom of a single 
video camera. In other cases, several fixed cam- 
eras are used and a technician manually switches 
control between the different cameras. The use of 
technicians is cost prohibitive (technicians are ex- 
pensive) and does not allow for spontaneous con- 
ferencing, since the technician time m most cases 
must be scheduled in advance. Moreover, it is 
often disturbing to the participants to have a techni- 
cian roam about the meeting with a video camera. 

Accordingly, conventional video systems typi- 
cally go to great lengths to cifcumvent the uses of 
tectinicians by allowing users to manually rontrol 
tfie [xjsition of the^ cariiera witli tiand-held (/ontrol- 
lors fMf tlio pan, tilt, and zoom functions Addition- 
ally, s^'me^ systems pu^/ide a f^uml)or of h^oatu^o 
f)reso't>' Ifie user fust [uograms tfie prt^s^Ms ai^d 
tfH^n, ;M[h a one-t.)iJttcjn eontiol, [positions tlio- caf7i- 
era tu'tv.et^n th(-:.o pfeMiefmed loerition-, Ifn-, f,>.e 
t^'f * ' niuTit M - , ,f Or.:v/| f\< miin. -n; ml - 



them IS that tfie users aie burdened with tfie d.'s- 
tractive task of choreograf^hing tfie evt;rit or r^ieet- 
ing In addition, the users canned ix- pt;f rintto.l to 
fiiove outside tficse prestM locations, thai is free 
movefTierM is significantly rt^stnrh'd 

Automated solutions also havo notable limita- 
tior.s One such solut.Dn irv.olver aoo^st... o.toctiJn 
of voice energy witti cariieia positioning, f-or exam- 
ple, US Patent No 4, 581, 758 to Coker discloses 
^0 the use of spatially separated micropfiones for ob- 
taining the location of a sound source which can be 
used for camera pointing One of the problems 
common to acoustic detection include distinguish- 
ing between sounds originating m the room and 
^'> those being produced froe^ the far end. Such a 
system may tend to point to the conferencing 
audio speakers when a distant participant is speak- 
ing Accordingly, systems which focus exclusively 
on voice energy detection have t)een or^ly r7iargin- 
ally successful 

F^arkerVision, Inc , 8493 Bay me.utuws Way, 
Jacksonville, f-L l^as mark(?t(Ml several carTV 

ora positioning products. Tfieir CameraMan^- pro- 
ducts allow a video camera to automatically track 
\hc movements of a person The Carr^eraManCy is a 
swivel base upon which a video camera is placed. 
This base is programmed to track the movement of 
a person with a hand-held or wearable wand. The 
wand acts as an infrared electronic beacon for the 
30 base to follow by panning the camera. In lieu of 
following the wand, the CameraMau'c^ also provides 
position presets. The field of view of the camera 
may be manually (by remote control) or automati- 
cally cycled between these preset locations 
35 The wand also incorporates a wireless micro- 

phone so that the voice of a person holding the 
wand will be picked up and transmitted, by VHF-TV 
frequencies, to the video camera. 

The CameraMan® products and systems that 
40 employ similar infrared tracking have fundamental 
limitations. Namely, these systems do not provide 
the ability of capturing a conversation as it dynam- 
ically evolves. In multiple-user applications the 
wand must actually be passed from hand to hand 
4h as different people speak. This is awkward and 
does not lend itself to natural flowing conversa- 
tions 

SUMMARY OF THE INVENTION 

The invention is a voice-foiiowiug viO-M, systom 
for dynamically tracking an activo' spr,'ak..'r Ihis 
system includes a video can^(;ra, a psicMine mntiol 
system for the video (.:ameei. and nn^-: or rTioie 
voicc-activated emitters. Tfie vidt.^M c.afi^ora. is mc;- 
clianically and electrically cotJf)led to th*,^ ftrtsition 
control system lliKMigfi tfi(; mo<h<inea! (rM^filmg. 

position conUdl ^:ys|ofn . ,ie ;).!n ;:rid til! tii^' 
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v)dv)o camera Ihfougli the etoctfical coupling, ttio 
pusition conlrol systc^m can con^nand the video 
carrier a to /oom-m o\ .^oonvout 

[ :u^^ vuice-activcitod omitti'f c" to bo asso- 
ciated with a person wfio is a potential speaker to 
be tracked by the video carYiora f or exar^iple, 
eacfi orTiittor nnay t)e associated witli a participant 
of a Video conference tn the preferred ennbodi- 
mofM. the emitters are cor^figured as badges which 
are worn on a conference participant's clothing u) 
Wfien a participant is speaking, the emitter will 
detect ttie sound or voice energy and will produce 
an infrared position signal. The infrared position 
Signal ts received by the position control system 
The position control system determines the posi- it^ 
tton of the active emitter (using, for example, tfie 
geometric principles of tnanguiation) and adjusts 
the pan and tilt of the video camera to bring the 
speaking participarM (i e , the active speaker) into 
tl^o field of view of the video camera -'o 

[he position cor^tfol system includes a general 
|Hjtt.)ose computet, an infrared feceivuK] systerTi, 
arid a servo-controiler The voice-activated emitter 
includes an audio detector circuit and a transmitter 
circuit. The audio detector circuit includes threshol- ?s 
ding and tuning, which facilitate the rejection of 
extraneous noises to make the emitter responsive 
only to the voice energy of the person wearing the 
emitter. 

In an alternate embodiment, the emitter may so 
also encode the voice energy of the speaker into 
the tnfrarod position signal for transmission to the 
position control system. To facilitate this, the voice- 
activated emitter of this alternate preferred embodi- 
ment further includes a carrier oscillator and a 35 
rT^odulator circuit. The carrier oscillator and the 
modulator circuit modulate the voice energy of the 
speaker and provide a modulated voice signal to 
the transmitter. The modulated voice signal is then 
used to produce an infrared modulated position 40 
signal. In this alternate preferred embodiment, the 
position control system will decode the infrared 
position signal to reveal the encoded voice energy. 
This eliminates the need for centrally located mi- 
crophones to pick up the voice energy of the 
cofifcrence participants. 

The voice-foilowifTg video system of the inven- 
tion rTvay be operated in one of two different 
modes In an autorT^^itic mode, the voice-following 
features of the invention rnaintain ttie camera field 
!jf vu;w on {ho activr^ s[X."aker In a manual mode, a 
rt:'m(no corMrol key f^ad allows matuiai positiorimg 
uf tti( ; vidc!0 carrier a 

Iho f(^fogoir^g, <'ind other features and advan- 
Ui(j(.'s C'l \hQ iuve[ition will be a[)parent from Wm: 
fuikiwirvj, nvvo [-)aftn ular description of a preferred 
' :fnl )' idimt Mit nf \\]< - invs'niinn, as illiistratod in ttin 
,111 i ) ( n [ 1 . i f 1 V 1 n ( ; d [ , 1 i n ' ; ■ 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a Mh.jIi love! tjlock Otagrar-n of a 

voice-foMov.ing vnlf.M; ( r:r^.f err-fv: ifig system 100, 

Figure 2A tllustrates .\n infrared voice-activated 

emitter badge 200 (front view), 

t igure 2B illustrates an irTfrarOLi voice-activated 

erTiitter badge 202 (rear view). 

Figure 3 is a high level block diagram of a 

voice-activated erTiitter 104; 

Figure 4 IS a detailed functional block diagram 
of a voice-activaled emitter 104. 
Figure 5 is a functional block diagram of a 
position control system 106; Figure 6 is a de- 
tailed functional block diagrarri of a voice-ac- 
tivated modulating emitter 600; and 
Figure 7 is a functional block diagram of a 
demodulating position control system 700. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

The preferred en^bodiments of the invention 
are discussed m detail below. While specific part 
numbers and or configurations are discussed, it 
should be understood that this is done for illustra- 
tion purposes only. A person skilled in the relevant 
art will recognize that other components and con- 
figurations may be usea without parting from the 
spirit and scope of the invention. 

The preferred embodiments of the invention 
are now described with reference to tfie figures 
where like reference numbers indicate like ele- 
ments. Also in the figures, the left most digit of 
each reference number corresponds to the figure in 
which the reference number is first used. 

Figure 1 is a high level block diagram of a 
voice-following video conferencing system 100 
{hereinafter, "video system lOO"), according to the 
present invention. Video system 100 includes a 
digital video camera 102, a plurality of voice-ac- 
tivated emitters 104. a position control system 106, 
a microphone 108, a video monitor 110. and a 
keypad 1 12. Also shown in Figure 1 is communica- 
tion link 114, winch connects video system 100 
with similar remote systems for the transmission of 
audio, data, and video inforrT^ation. Communicatioci 
link 114 may bo, for exampte. a local area network, 
coaxial cable, a teloptionc transnnission line, or a 
fiber of"3tic cable 

A vid^'O confer (jncirTCj application with multiple 
[:)artK;tpants is illustrated to provide an understand- 
ing of the various features and aspects of tfie 
invention. Oth(.;f applications tor the inventiori in- 
clu(ie. for (.example. cTassroom, ICM.ture hall, and 
social functions 

In this prt'ftrntMi nriU M::^dim* Mit , ;;S a participant 
tfi,. -M. 'd t'fn,Mt'( 104 iMfismits. ;ind 
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position rtjnt^ol systcrii 106 receives infrared (IR) 
sigfials [-arji eniittuf 104 is in the fonii of a badge, 
wf^ich niav t->c attached to tfio clothing of a partici- 
pafM F-iguir^s ?A and 23 show ttie front view 200 
and roar vk:^w . respectively, ef erTT;tter 104. On 
{\'\Q front siOt^ a fT^icrof:)tione of)ening 206 and an If^ 
lof^s 204 firovide for tfie passage of ttie voice 
energy ano the IF^ signals, respectively On ttie rear 
side, err^itter 104 fias a garment ciif) 210, a battery 
compartment door 208. and an on/off button 212 
E:ach emitter 104 is a self powered unit worn by a 
participant Emitter 104 may also be produced in a 
form that IS not worn, such as a table-top model, 
which IS placed m close proximity to each partici- 
pant 

Video camera 102 has conventional camera 
features and an RS-232 serial port for control. In 
additiori. ar^ appropriate lens filter is attached to 
reduce tht; arnount of IR er^ergy detected by tiie 
cliarged coupled (Jovice (CCD) array. Tfiis is desir- 
able t)ccause a CCD can often resolve ttie normally 
ir^visible li'jnt, whi^h rnay i.ause <i flasf^ on ttie 
video rTnonitor 110 wfien emitter 104 is activated. 
This phenomena could be distracting to users of 
tfie conferencing system. 

This preferred embodiment of the invention 
generally has two modes of operation. In a manual 
mode, the field of view of video camera 102 can be 
set to one or several participants, or on visual 
displays, such as a blackboard. Changes to the 
field of view, such as panning, tilting, and zooming 
are done manually through keypad 112. Keypad 
112 is hardwired to position control system 106. 
However, a conventional wireless remote control 
system may also be used. Also, position control 
systerm 106 can be programmed for a number of 
preset locations. Thus, video camera 102 can be 
positioned between these preset locations by 
pressing an associated button on keypad 112. This 
manual mode may be used m applications that do 
not require spontaneous and frequent movements 
of video camera 102. A business presentation with 
a small number of participants and a structured 
agenda is an example. 

For capturing conversations with free flowing 
interaction for true face-to-face video conferencing, 
the voice-following mode is used. In ttiis mode, 
video system 100 will dynarDically detect and 
gracfMully change \\^Q field of view of camera 102 
tovv'cird the active speaker among tlio participants, 
[lach time a participant starts speaking, ttieir asso- 
ciatO'1 oriiittor 104 is activated to transmit a signal 
{^) (] . .ifi IR signal) for the duration of tiiTio tfiat ttie 
(KirtiLifiant IS talkirig. 

G^'^'rrespondir^gly, position corMrol systcfii 106 is 
• ;qui; [ukI witfi tf^ sensors t(.) detect tf^e IR siijfTal 
- riiiis d an t.'mittor 104. and to deterrTiint; th*; 
1-H,:;..n 1 '1 the active :;pt^iktT [i.i^.ed on tth; \\\ 



signal PoSitnjr^ control system 106 th.s-n (jar^s 
and or tilts vidtK) carrierci 102 ttmartl thu active 
spoak^-r, (f tht' (;)eisorT is nut already in the field of 
vM}w, a.nd grcu C'fully tiaiks tlio )vt -rTiont of tf^at 
person F^osition cornrol system 106 ran also send 
coriTrT^a.nds dirt.M"_;tly to vKlt:'0 camt^a 102 fs)r lens 
alignmvMit. f-ot c'xample, posilKjn t untrol system 
106 can send zoom and focus conuTiands to better 
view an active S["jeaker Heuristics and otliuf con- 

u) ventional programming techniques are employed 
by position coritrol system 106 to precisely deter- 
mine the panning and tiltirig niovernents to acfiieve 
graceful positioning. 

Tigure 3 is a high level block diagram of emit- 

/s ter 104 Emitter 104 includes an audio detector 
300, a transmitter 302, and a power source 304. 
Audio detector 300 is designed to detect voice 
energy m a predetcrrTnmed zone, such that it will be 
activated only by the voice energy of tfie nearest 

.'(.) particifiant , f3resumal:)ly, U^c pt;fSon actually 
wearing the associated oiTiitter 104) Wfis^n audio 
deiS;(.:tc.:f 300 rer.eives vo\''<: (";nor(]y that e,>'.coeds a 
preduteuTTined thtesiiold, audio detector 300 pro- 
duces an ACTIVE signal 301. For the duration of 

PS time that the participant talks above thie tfueshold, 
ACTIVE signal 301 is maintained. 

In response to ACTIVE signal 301, transmitter 
302 emits a POSITION signal 303 In this preferred 
embodiment, POSITION signal 303 is an IR signal. 

30 Since POSITION signal 303 emits from ttie location 
of the active speaker, it acts as a beacon for 
position controf system 106 to locate and track the 
active speaker. Conversely, when a participant 
stops speaking, ACTIVE signal 301 and POSITION 

35 Signal 303 are no longer generated. In this manner, 
video camera 102 is pointed at the person cur- 
rently speaking and, as the speaker changes, so 
does the positioning of video camera 102. 

Figure 4 shows a more detailed functional 

40 block diagram of emitter 104. At this functional 
level, transmitter 302 includes a transmitter am- 
plifier 406 and an IR light emitting diode (LED) 408. 
Power source 304 includes an on off switch 410 
and a battery 412. In response to Active Signal 

4f) 301, transmitter amplifier 406 electrically drives IR 
LED 408, thus generating POSITION signal 303. 

Also, at tfiis functional level, audio detector 300 
includes a directional fTiicrophone 400, a pre-am- 
plifier 402 and a band-pass filterdhrosliofd detector 

bo 404 (filter/detector 404). To prevent false activation 
of emitter 104 {i.e., protiibiting tfie trifjger oi emitter 
104 \)\' armbient room noise or by th..> voice energy 
outsidt} of tfio ptedetcrfTiir^ed zor^tO dir ecti-.jnal n^i- 
crophone 400, micropfione pre-arTiplifier 402, and 

')S filter detectoi 404 are electrically tuned to detect 
only energy iri ttie hurTian voiCt.> frcguerv y t)and- 
widtti. f "urthcM niorf.s fi lt<";r .'d<:;tector 404 is (S'SKjnod 
tn igrS'fi' tow , im [ ilitu< it ' vi^trc (Mlf'fijv Ihir und.tM 
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nornia) cofnlitions emitter 104 is only resperisive ■ 
tho v(.iicc cnciciy inim ttie pr edetor ined /ene 

f igurC' {■) IS ci fenctionai tileck diagram ef posi^ 
tt( )n f.oetiot systc;fii 10G fV^'Situni corMrol system 
106 iiiclLJdes a receiver bOO, a compLJtcr b02. a. 
^^ervM-rontfolltM S04, and an \^S-232 interla(a? bO(3 
In tts piefeired emt)odimerM, receiver SOD is an 
array ef thrc.H^ if^ st3nsors Hewever, linear stjnser 
arrays and other stgrial dete^ding devices rTi<iy al- 
ternatively t)e used. 

Computer b02 is an NCR Model 3333 general 
purpose computer available from NCR Corporation, 
Dayton. Ohio Computer 502 includes \\~\o appro- 
priate commufTication interfaces with receiver 500, 
servo-controller 504, arid corTinujnication link 114 
CorTiputer 502 communicates witti video camera 
102 via RS-232 interface 506 

Scrvo-contr(^ller 504 is an electf omechanical 
assembly m whicli video camera 102 is mounter i 
Ser vo-contioll(M 504 perforrTis tlie actual [lanntm^ 
and tilting of video cariiera 102 

li"! thr,- v ih ( ■ISjwing modi', tli*"> rectjivff 50li 
detects the If^ sigrval from an activated en^itt*?r 104 
and generates sensor array signal 508. By using 
linear programming and optical tnangulation tecti- 
niques, corT^puter 502 determines the location of 
the source of the IR POSITION signal 303 (that ts, 
the speaking participant). Computer 502 then deter- 
mines the appropriate pan. ttit, and zoom adjust- 
ments and sends the corresponding commands to 
servo-controller 504 and video camera 102 via the 
RS-232 interface 506 to bring/maintain the active 
speaker within the field of view of video camera 
102. 

tn the preferred embodiment, computer 502 is 
programmed with heuristics or other techniques to 
intelligently and gently track tho motion of the 
active speaker. For example, computer 502 re- 
solves conflicts when two or more participants are 
concurrently speaking by giving priority to the par- 
ticipant who was first detected, and filtering out 
short bursts of voice energy, such as when a 
participant is briefly responding to the active 
speaker. Another example is the situation when 
none of [ho participants are speaking. corTiputer 
502 IS programmed to zoom out video camera 102 
to provide a wide afx^le view of tlio conference 

Additionally, computer 502 may be pro- 
granvnod \o analyze the outpLJt of video carinera 
102 to locate "liead and shoulders" of the activ(; 
spL-akof for fvecision adjustments of thr; field of 
viow Surdi a technique is OisrK.istxl in commonly 
owned, L o-pendmt.j U.S Patt^'nt Af^plication Nd 
08 094.2H(). tiled July 19, 1993. titled, "Mt^thod an<i 
Mear^s f^ 'f Detecting f^eoph) and Imagt^ St;- 
Ouonc<?s", wliirh is ifTCor f^ior att m1 litirrMn In' refr;r- 
r- n r o 



In iv,] aJtomate prt.^ferreo embodirm.Mit of tht^ 
iMV(-nti<jn, rmiitur 104 may Pt"* replaLt^-d witli 
VMh t .-ardtvattM j modulating tjmittvM 600 Lmittor GOu 
\:. MiTiiUir in -jf lOr atiun ti* umilttM 104 hnwevtji, ifi 

addithM^ ti) a^ tifig like ci bccu on to allow lov.;atiun uf 
tht.' activt' sfit;akor. urrniter GOO modulates the von.*; 
*;iu'rgy ol tlu.' activu S()eaket into a MODULATt U 
POSITION signal 605. whicfi is transmitted to a 
dtMnodulatin^j positiOf^ cofitrol systerii 700 In thi^. 

?() <ilternate [jreferred erTibodtment. fiowever, position 
control systeiT^ 700 includes a receiver.'defTiodulat(jf 
700 for receiving MODULAlbD POSITION signal 
605 demodulating it to recover the voice energy 
ref.)resenting the active speaker's voice. Such a 

It. systen^ (^hriunates the need for one or more cen- 
trally located microptiorTes In addition, tfie fidelity 
of \hc voice signal may be improved by ehmir^ating 
many amt)ienl room noises. 

A block diagram of emitter 600 is shown m 

;•(' [ igure 6 LrTiittor 600 mcludos an auOio detect^tr 
()(J2, a tumsirntter 604, a carrier osLillator 608, a 

fin i< It (jOG, and fjuwi,;: SOurn; GlO AuO;.] 

dfMoctor 602 and tr ansiTiitter 604 opeiate in a mafi- 
nur similar to that of audio detector 300 and trans- 
mitter 302 of erT^itter 104 described above, in adoi- 
tiOf\ however, audio detector 602 sends a voice 
energy signal 603 to modulator 606. Modulator 606 
receives voice energy signal 603 and a earner 
signal 607 from earner oscillator 608. Modulator 
30 606 then modulates carrier signal 607 with voice 
energy signal 603 to produce a modulated voice 
signal 609. 

Transmitter 604 receives modulated voice sig- 
nal 609 from modulator 606. Transmitter 604 pro- 

3i:. duces MODULATED POSITION signal 605 when 
enabled by ACTIVE signal 301 from audio detector 
602. MODULATED POSITION signal 605 is an IR 
Signal winch contains the voice energy signal 603 
representing the speech of ttie active speaker. 

40 A block diagram of position control system 700 

IS shown in Figure 7. Position control system in- 
cludes a receiver/demodulator 708, a computet 
702. a servo-controller 704. and an RS-232 inter- 
face 706. Position control system 700 operates m a 
smiilar rTianner as position control system 106 
A(. cordingly , receiver demodulator 708 receivuS 
MODULATf.D POSITION signal 605 and generates 
a sensor array signal 710 In aodition, receiv- 
der7iO(dulator 708 d..'COdus MODUi.AlLD POSl- 

.sr; "1 ION SKjnal 605 and genetat^js ruLoivt-o voict.' en- 
ef(jy signal 712, winch reflects tho v*jh c ciiergy (.)( 
tfu".; active sf^'siaker 

WhiU' the invLMition tias tjv.Mi padicula.rly 
slujwn and descnt^ed with i (;f(;f eru t.' to S(;vi.^ral 
pru'feued (.'mbodimt.MitS th(;rt Mjt , il win In.; under 
stcHid by those sktlltMl m th^; art thai various 
(flanges m torn) a.nd dt^ailr^ may 1'*^ m.iO.' thiMfin 
wiliiMMt d''|'.:diMg 1r>.iv tl.- ■ 1 : ■ > ■ ' ' 'h- 
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inveritiOfT as defined in the appofuKx.1 claims 
Claims 



1. 



A vidt.H.) systOfTi fof dynamn :illy folKjwintj ao 
active Sfioakor, coinfinsing 

(a) a video carrier a. 

(b) voice-activatod emttttctg rTieans, asstj- 
ciated with the active speaker, for receivifK; 
voice energy from the active speaker and 
for erTiittirig a position signal in response [o 
said voice energy, and 

(c) positioning means, responsive to said 
position signal, for positioning said video 
camera to track the active speaker 



2. The systofTi of clairTi 1. wfTerem said position- 
ing means comprises: 

(a) foccivirig means for receiving said posi- 
tion si(}nal, 

(b) coFTifjLJting means for receiving said f3(j- 
sition signal ffom said recoiviiKj fTieans and 
for f.)roducing a control signal, and 

{c} sor vo-controlling moaris, fosfjonsive to 
said control signal, for positioning said video 
camera so that the active speaker is in a 
field of view of said video camera. 

wherein said receiving means, said 
computing means, and said servo-control- 
ling means are operably coupled to dynam- 
ically detect and gracefully track the active 
speaker. 

3. The system of claim 1 . wherein said voice- 
activated emitting means comprises: 

(a) detecting means for senstrig said voice 
energy and generating an active signal 
upon said voice energy exceeding a pro- 
determined threshold; and 

(b) transmitting means, responsive to said 
active signal, for emitting said position sig- 
nal. 

4. The systerT^ of claim 3, wherein said detecting 
means comprises: 

(a) a fnicrophone; 

(b) a pro-amplifier electrically coupled to 
said microphone; and 

(c) a Ifireshold detector electrically coupled 
to said pro-am plifi CM 

5. Th.o systtMTi r:f dairti 3. wtiert.>tn said transriiit- 
ting rTieans C( mipF isos 

(a) an iniianMi lujht (;mittin(j diodt- (IT-O) 

6. The sy;-.trnTi ^.i .Maim 1. whoM?in said vok^o- 
<ii"di vatt M 1 crniitifv.j means i 'cimpr i s< *^; ■ 
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(a) audio energy ujceivirTg mt?ans feu receiv- 
ing said voice orTOfcjy, 

(b) mtjdulating mc-ans, eloctrtrally connoct- 
t..;d to Sciid autliu u-nergy f i;^. cm vi rK j fntjans 
fijr en(:oding sauj voice onuigy in a earner 
signal to fxoduce said position sigrual; and 
fid emittinij mi^an^ ft.u emitiiru^ said p'-^sition 
Signal 

7. The systCMTi of claifTi (>, wherein said position- 
ing rTieans comprises 

(a) decoding rTieans for decoding said posi- 
tion signal on said earner signal to repro- 
duce said voice cn(3rgy 

8. Ihe systefTi of clairii 1, wfiefoin said voice- 
activated emitting means corTiprises a badge 
worrT on the body of tfie active Sf")eakor 

9. A video ( af)ture systt^ri tor dyrvamically follow- 
ing an active speaker. ( :ofTi[:)r ismg 

(a) a vidof") cafTU;ra. , 

(b) a vuicu-activati.'d tMnittor associated with 
the active speaker, said voice-activated em- 
itter trarnsmittinq a position signal in re- 
sponse to voice energy received from the 
active speaker; and 

(c) a position control system, responsive to 
said position signal, to position said video 
camera to follow the active speaker. 

10. The syster-n of claim 9, wherein said position 
control system comprises: 

(a) a receiver to receive satd position signal; 

(b) a corTiputer to receive said position sig- 
nal from said receiver and to produce a 
control signal; and 

(c) a servo-controller, responsive to said 
control signal, to position said video camera 
so that the active speaker is in a field of 
view of said video camera. 

wherein satd receiver, said computer 
and said servo-controller are operably coup- 
led to automatically and fluidly capture the 
active speaker within tfie view of said video 
camera. 

11. The systor7i of claim 9, whofOirT said voice- 
activated orTiittor comprises: 

fa) an audio det("?'d()f t(_) ^tjnsti said voice 
energy and gcnoratt: an a.tivf sn^nai upon 
said voiC(j en(.'r(jy exceedmd a prt";deter- 
mmcMj tiiri,'SfioK1, and 

(t)) a transmitit^r \n (Mnit said [losition signal 
in rt'Sf")nnsr^ to said a.ctivc; si'jnal 

12. Tfit; sys.t^MTi of ( laim 11, vdicffin said a.udto 
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^ (a) a rTiicr(.)[:)[iofu:;, 

(\^) a pro-ainf-ilifier olortr ic ally ■"(■upl'^-l to 
baid r7ii(:ro(")fioaG, and 

{(■:) a thfoshold detoctor t;l* Mdn* .nlv fmif't-d 

to sai(i (.)f eani[)lifior and turnul i ::aid pro- s 

(^o^^M minod thfcsiioid 

13. systOfT^ cM c\a\n^ 11, whcrotfi said trar^smit- 
ooni}")r ises: 

(a) an infrarod \\gh\ OfT^itttng diode (I i~-D) lo 

14. Ihv. system of claifT^ 9. whef(?in said voice- 
activalcd emitter comprises: 

(a) an audio energy receiver to riMX-ive said 
voice energy, said audio enetgy receiver 
including a microphone, an audio an^[)lifier, 
and a threshold detoctof. 

(b) a modulator, clectrtcatly connected to 
said audfo (Mioigy receiver U) (Mii ode said 
voice of^Ofgy m a caniur SKjnai lo fjtoduce 
said; positujn Signal; anct 

(( ) .in (jmitlu: U) r^mit s.no ["K ^n Mon.:: 

15. The system of ciatfTi 14, wherein said position 
controt system comprises: 

(a) a demodulator to decode said position 
signal to reproduce said voice energy. 

16. The system of claim 9. wherein said voice- 
activated emitter comprises a badge worn on 30 
the body of tiie active speaker. 

17. A system for controtling a video camera to pan 
between a plurality of people so that the cam- 



19. Tfic sy^dorT^ ot ciCi.'r 1 r. ^'.c.-rt'in said vuiL-- 
activatC'd transmitt^rr r^u^afi:: ^MfTipiises 

(a) an infrared I'ont i.,-rTHtli"g stujrce for unut- 
tinc) said sii";)n.il 

20. A systism for usl' wiin vidtM) camera to 
automatically [josition tfu; vuicu < artu^ia to iui- 
low an active speaker, ctHTipusmg 

a vc^ice-activatt.'d emittr-r to rtfceive voico 
energy fror^i the active speaktM and to ennit a 
[Kjsitton sigr^al in response to said voice en- 
ergy, and 

a camera position coritroi system to re- 
ceive said position sigr^ai and to point tf^e 
video camera m ttie direction of said emitter to 
viow tfie active speaker 



era dynamically tracks a person currently 3b 
speaking, the system comprising: 

(a) a plurality of badges, each badge being 
associated with one of the plurality of peo- 
ple, each badge including voice-activated 
transmitter means for receiving voice en- 4o 
ergy and for emitting a signal if said voice 
energy exceeds a threshold, wherein said 
threshold is selected so that said signal is 
transmitted only by a badge associated with 

Ifie person currently speaking, and 

(b) positioning means, responsive to said 
signal, for positioning said video canK-ra to 
track {he person currently speaking 

18. The system of claim 1/. wlieiem said v<::i(A3- 
ar tivated tr ansr^iittor fTjeans < nrTipnses 

(a) M;{-eivirT(j mearis int rtM t^tving S<.:id vHre 
e>ner(jy , and 

(I)) trrirTsmitting rTieans, ri^s-p* msivt^ U' said 
r e("("^ivin( j means, for transniitting said sig- 
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EMITTER BADGE (Front View) 



FIG. 2A 
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EMITTER BADGE (Rear View) 



FIG. 2B 
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